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DISTRIBUTED TIME-MULTIPLEXED BUS ARCHITECTURE 
AND EMULATION APPARATUS 

BACKGROUND OF THE INVENTION 
5 The present invention concerns distributed time-multiplexed bus architecture 

and emulation apparatus, and more particularly to a time-multiplexed bus circuit and 
system that greatly reduces propagation delay in emulation or prototyping systems. 

Electronics systems generally involve the connection of two or more system 
components together via a high speed data channel. In order to save on system costs the 

10 number of data channels or data buses is usually kept to a minimum. The same data 
channel is often used to transfer data between multiple system components in either 
direction and may also be used to transfer data from one component to several other 
components in a broadcast mode. In order to optimize the overall system performance it 
is necessary to minimize the amount of time that the data channel is idle between data 

1 5 transfers and to maximize the rate at which data can be transferred. 

In a typical system, there is an arbitration logic block that determines which 
system block is allowed to initiate a data transfer at any given time. This prevents two 
different system components from transferring data at the same time on the same data 
channel and thus corrupting each other's data. Typically a system component will 

20 request the use of the data channel from the arbitration unit and when the data channel is 
free that component will be granted the use of the data channel. The system component 
can then start transferring data on the data channel. There is generally a delay between 
when the system component is granted the use of the data channel and when it is 
capable of starting an actual data transfer because the system component must 

25 recognized the grant signal and then enable its data buffers to the data bus. Li a 

synchronous system, this delay is at a minimum one clock period but can be longer. This 
leaves the data channel unused for some period of time and consequently degrades the 
overall system performance. 

IBM in its distributed multiplexer (see patent cited below) breaks the data 

30 channel interface into two pieces. One piece is a data node local to each system 

component that is used to set up the data that is to be transferred on the data channel the 
clock period prior to data actual being driven on the data channel. The other piece is a 
multiplexer component that connects to the data channel. This is done to minimize the 
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time delay to actually make the transfer, since only the time delay of the multiplexer 
component is involved (not the setup and delay of the local data node portion of the 
circuitry). In this scheme, each system component multiplexer has a data input bus and 
a data output bus. The data output bus is a logic OR of the component's internal data 
5 from the local data node piece and the input data bus of the multiplexer portion of the 
circuitry. There is then a system level bus grant signal that comes from the arbitration 
logic that is used to enable the internal data into the logic OR. Throughout the system 
the output data bus of one component is connected to the input bus of the next 
component in a daisy-chain fashion. The arbitration logic is responsible for enabling 

10 only one system component at a time. The result is that the data bus always contains the 
data driven from the enabled component without any clock delays needed for a system 
component to decode a grant signal and then sequentially enable data bus buffers. 

Such a distributed multiplexing scheme is slow when used in a system where the 
data bus is a time-multiplexed bus. This is true because in a time-multiplexed system it 

15 takes one timeslot or clock cycle to transfer data through each system component's logic 
OR. Thus for a system with five components in the bus chain, it would take five 
timeslots to transfer the correct data throughout the data channel. The current invention 
reduces the number of timeslots necessary to get the correct data on the data bus to one. 
Wired-OR buses are commonly used in computer systems today, as are 

20 distributed multiplexers, and both have been used in emulation systems. See, for 
example, U.S. Patent No. 5,425,022 assigned to British Telecom entitled DATA 
SWITCHING NODES; U.S. Patent No. 5,789,966 assigned to IBM entitled 
DISTRIBUTED MULTIPLEXER; and U.S. Patent No. 5,596,742 assigned to MIT 
entitled VIRTUAL INTERCONNECTIONS FOR RECONFIGURABLE LOGIC 

25 SYSTEMS. 

SUMMARY OF THE INVENTION 
A time-multiplexed data bus driver circuit includes a plurality of combinatorial 
circuits, each of the circuits forming a logic combination of a datum value, a datum 
30 enable signal and a datum timeslot signal, the plurality of circuits producing a plurality 
of output signals based thereon, and a wired-OR junction producing a logic OR 
combination of the plurality of output signals representing a time-multiplexed data 
stream. Preferably, for each of the combinatorial circuits the enable signal and the 
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timeslot signal are combined in an AND function to produce a gating signal for the 
datum value, one selected datum value being gated onto the wired-OR junction, at a 
given timeslot, when enabled. The common time-multiplexed data channel at the 
wired-OR junction has a pull-up device connected to a bias voltage or a pull-down 
5 device connected to a ground voltage, thereby representing in an un-driven state a 
respective logic 1 or 0. 

The foregoing and other objects, features and advantages of the invention will 
become more readily apparent from the following detailed description of a preferred 
embodiment that proceeds with reference to the drawings. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a simplified schematic diagram of the distributed time-multiplexed bus 
architecture in accordance with a preferred embodiment of the invention. 

Fig. 2 is a timing diagram illustrating the datum, datum enable signal, datum 
15 timeslot signal and output data timing in accordance with a preferred embodiment of the 
invention. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
In the current invention, a common time-multiplexed bus is used to connect all 

20 system components. However, instead using one timeslot to transfer data just between 
two nodes, the resultant data from all nodes is transferred in a single timeslot. To do 
this, all nodes are enabled at the same timeslot. The previous timeslot is used to transfer 
the value of the grant signal to all nodes. Then during the enabled timeslot the node that 
is granted the bus will drive data onto the bus and all other nodes will float the bus. 

25 This allows the transfer to be done within one timeslot. 

Fig. 1 illustrates the apparatus in accordance with a preferred embodiment at 10. 
Apparatus 10 includes plural instances of identical combinatorial circuits of which two 
are indicated at 12 and 14 within dashed outlines. Each combinatorial circuit may be 
seen to form a logic combination of a DATUM value, a datum ENABLE or gate signal 

30 and a datum TIMESLOT signal. The plurality of circuits such as circuits 12 and 14 may 
be understood to produce a plurality of output signals such as OUTPUTm and 
OUTPUTn based upon the value and timing of their inputs. A wired-OR junction 
indicated in Fig. 1 as node 16 will be understood to produce a logic OR combination of 
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the plural high-active output signals. As will be seen, this logic combination of plural 
output signals represents a time-multiplexed data stream. Those of skill in the art will 
appreciate that, within the spirit and scope of the invention, typically there may be 
dozens, hundreds or even thousands or more such wired-OR output signals. 
5 Those of skill in the art will appreciate that the term wired-OR is used most 

broadly herein to refer to a common junction of plural signals, whether representing 
high-active or low-active logic. Thus, in a high-active logic system, the junction 
produces a logic one at the output when any input to the common junction is high, 
effectively producing a logic OR of the inputs. Conversely, in a low-active logic 

10 system, the junction produces a logic zero at the output when any input to the common 
junction is low, effectively producing a logic AND of the inputs. By changing the high- 
positive (logic 1 = true) or low-positive (logic 0 = true) logic of the drive gates and data 
nodes, it will be understood that the wired-OR junction described and illustrated herein 
can function to produce either a logic AND or a logic OR function, based upon well- 

15 known logic conventions.' 

In each combinatorial circuit, the ENABLE signal, the TIMESLOT signal, the 
DATUM signal and the SYSCLK (low) signal are combined in an AND function 
represented by AND gates 18 and 20. For a given TIMESLOT, the gating signal to the 
tri-state buffer will be true only if the ENABLE and DATUM signals are true. If this is 

20 the case, the active-datum "0" on the input of the tri-state buffer will be gated onto the 
wired-OR junction 16. Since this junction has previously been precharged to a "1" by 
pre-charge transistor 30, it will stay high (which represents a "0" in active-low logic) if 
none of the tri-state buffers is enabled. However, if one or more of the tri-state buffers 
is enabled, then junction 16 will be driven to a low level (which represents a "1" in 

25 active-low logic), thus correctly performing the OR function of the distributed 

multiplexer. Since the data at junction 16 is active-low, the receiver of the data must re- 
invert that data to correctly reproduce the distributed multiplexer's output. 
Alternatively, junction 16 could be implemented in active-high logic, in which case the 
pre-charge would be to a low level and the input to the tri-state buffer would be a "1." 

30 In this case, no inverter is need at the receiver. 

A timeslot generator (or timeslot generation logic) 26 produces a plurality of 
TIMESLOT enable signals one of which, e.g. TIMESLOTj, preferably drives 
corresponding first inputs of AND gates 18 and 20. The other corresponding inputs to 
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AND gates 18 and 20 are the ENABLE or grant outputs from arbitration logic 28, which 
selectively enables only one output at a time in accordance with a system-level 
determined priority allocation scheme, as is known. 

Those of skill in the art will appreciate that the non-selected outputs are inactive 
5 and thus the corresponding AND gate's output is inactive and a corresponding tri-state 
buffer drives its output signal into a defined high-impedance state, thereby avoiding 
corruption of the selected datum value. In accordance with the preferred embodiments 
of the invention, only one selected datum value is gated onto wired-OR junction 16 at a 
given timeslot, when enabled. It may be seen from Fig. 1 that in accordance with the 

10 preferred embodiments of the invention, a plurality of timeslot enable signals input to a 
plurality of combinatorial circuits are connected in common, i.e. they are driven by a 
common output of timeslot generator 26. Thus, wired-OR junction 16 may be 
understood to be a single bit-wide, time-multiplexed data bus of channel. 

In real life, an error in the design of the arbitration logic or timeslot generator 

1 5 would produce an error in the output data. Similarly, in a verification system, an error 
in the design of the arbitration logic or timeslot generator should produce an error in the 
output data. Thus it is important in such a simulation or emulation context that errors, 
e.g. an incorrect arbitration logic function, be dutifully reproduced so that the designer 
can make design corrections before the design is committed to silicon or other physical 

20 embodiments. Those of skill in the art will appreciate that one or more such single bit- 
wide, time-multiplexed data channels may form a part of an emulation system, as such 
are well known. Those of skill in the art will appreciate that in such a field of use of the 
present invention, data nodes selection mechanism 28, timeslot selection mechanism 26 
and two or more switches such as switches 12 and 14 would form a part of the system 

25 being emulated. This is illustrated in Fig. 1 by various labels. 

In accordance with a first preferred embodiment of the invention, wired-OR 
junction 16 includes a pull-up device 30, e.g. a resistor or transistor, connected to a bias 
voltage 32 such as VCC (typically +3.3V in a digital system). In accordance with a 
second preferred embodiment of the invention, wired-OR junction 16 includes a pull- 

30 down device 34, e.g. a resistor or transistor, connected to a ground voltage 36. Those of 
skill in the art will appreciate that pulling the data bus up or down places the un-driven 
bus in a defined logic state thus enabling a wired-OR function. (The bus otherwise 
would remain in a high-impedance state at a so-called "float" voltage that is susceptible 
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of assuming either logic state, e.g. via signal cross-talk or ground noise). 

As an alternative solution, each node multiplexer can contain an OR gate as in 
the IBM distributed multiplexing scheme except that instead of daisy-chaining the data 
bus between node multiplexers, the OR gate can use a tri-state driver to drive a common 
5 data bus. The data bus must be resistively pulled up during the timeslot or pre-charged 
prior to the timeslot in which the data transfer is enabled. Since only one node is 
enabled at a time, the correct value will be driven onto the data bus or channel. 

Those of skill in the art will appreciate that pre-charging via an active transistor 
or other switching device is preferable, since it actively drives the un-driven bus to a 

10 known (high or low) voltage far more quickly (albeit a slowly decaying voltage) for 

operation of high-speed systems, thus ensuring a known logic state early in a given data 
cycle. This pre-charging is illustrated by way of transistors 30 and 34 in Fig. 1, whereas 
the less-preferred alternative pull-up or pull-down resistor approach is illustrated by way 
of phantom resistors 30' and 34\ It will be appreciated that typically only one of pre- 

15 charge transistor 30 and 34, gated by a negative-going precharge signal PRECHG-, as 
shown, (or alternatively only one of pull-up and pull-down resistors 30* and 34*) is 
present, depending upon the desired high-active or low-active logic state. 

Turning now to Fig. 2, the timing of the various signals of combinatorial circuits 
18 and 20 will be described. Those of skill in the art will appreciate that the rise and fall 

20 times are exaggerated in Fig. 2 to better illustrate various sequencing and set-up 

requirements. The bottom trace in Fig. 2 is a SYSCLK waveform, a system clock that is 
provided by a system utilizing the invention for synchronization purposes. The next 
trace up is a DATA TIMESLOT waveform, with each bit of data being supplied in 
succession within timeslots h, i, j, k, 1 and m. Those of skill will appreciate that such 

25 successive timeslots are enabled, in accordance with the invention, by timeslot generator 
26. 

The next trace up in Fig. 2 is a DATUMm ENABLE waveform, wherein the 
enable signal corresponding with timeslot j of interest is asserted nearly a full timeslot 
earlier at timeslot i. This is so that sufficient setup time for the enable signal to be 
30 propagated to the input of AND gate 18 within combinatorial circuit 12, as described by 
reference to Fig. 1. Those of skill in the art will appreciate that the shading to either 
side of the enable signal waveform represents a don't care condition for DATUMm 
ENABLE. The next trace up is a DATUMm VALUE waveform, the datum input to 
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buffer/driver 22 of combinatorial circuit 12. Again, the shading to either side represents 
a don't care condition for DATUMm VALUE. 

Finally in Fig. 2, the top trace is a DATA OUTPUT waveform corresponding 
generally with wired-OR junction 16 and illustrating the succession of bits 
5 corresponding with timeslots h, i, j, k, 1 and m on the channel. A vertical dotted line in 
Fig. 2 indicated by an arrow represents the point in time of interest at which DATUMm 
is enabled onto the time-multiplexed bus during TIMESLOTj. The datum is enabled 
onto the bus on the falling edge of S YSCLK and is captured at the receiver on the next 
rising edge of S YSCLK (as suggested by the bubble SYSCLK input to AND gates 18 
10 and 20 in Fig. 1). Those of skill in the art will appreciate that the PRECHARGE 

voltage segments in between the bits is to illustrate that buff er/dri vers 22 and 24, are tri- 
4f state buffers/drivers and that the bus is pulled up to a logic 1 or down to a logic 0 when 

Nj not driven by one of the plural data nodes. Importantly, it may be seen from Fig. 2 that 

gi a datum is supplied over the common time-multiplexed data channel synchronously at 

j£j 15 the speed of SYSCLK, within a given timeslot. 

%Q Another alternative solution, within the spirit and scope of the invention, is to 

use open-source drivers to pull up the data bus value when driven and a resistive pull- 
!r: down or an active pre-charge of the bus to a low state. 

m This basic scheme can be used with wide data buses 32-bit, 64-bit, etc., or with a 

s « 20 single wire serial data channel or an optical data channel. In the case of a single wire 

serial data channel, it is often desirable to have multiple system components that drive a 
single data channel at different times to enable the packing of distributed data into a 
single data packet that is to be sent over the data channel. The data channel can be the 
wired-OR of all of the outputs of the system components driving the data channel. This 
25 method allows different system components to drive the data channel on consecutive 
timeslots without the need for framing bit or blank timeslots between different driving 
components, as has been done before. Those of skill in the art will appreciate that the 
timeslots typically are assigned in accordance with a given set of system requirements 
so that all producers and all consumers of data know which timeslot belongs to which. 
30 The present invention might also be useful in the area of telecommunications 

where serial data from a variety of different sources are mixed together into a single 
higher speed serial data channel (refer to the cited British Telecom patent). Normally, 
all of the data must be selected onto a distributed multiplexing node and then selected 
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onto a single data channel. By allowing a set of system components to mix their data 
directly on a serial data channel, a less costly solution to the problem would be 
achieved. 

Having illustrated and described the principles of my invention in a preferred 
embodiment thereof, it should be readily apparent to those skilled in the art that the 
invention can be modified in arrangement and detail without departing from such 
principles. We claim all modifications coming within the spirit and scope of the 
accompanying claims. 
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